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PREFACE 


This  work  was  done  during  1965  at  RCA  Laboratories  in  Princeton,  N.  J. 

It  is  part  of  our  continuing  research  in  the  area  of  question-answering 
processes  and  their  relationship  to  more  general  problems  in  machine  problem 
solving.  An  RCA  Laboratories  report  covering  most  of  the  material  described 
here  was  issued  in  November  1965  under  the  title  "Translation  of  Engllsh-llke 
Queries  into  Efficient  Computer  Search  Programs  for  Question-Answering." 

After  several  refinements  and  revisions  the  Initial  report  has  evolved  Into 
Its  present  form, and  It  is  being  finalist  1  for  publication.  Since  the  material 
presented  here  Is  relevant  to  much  ongoing  research,  we  are  simultaneously 
Issuing  It  as  a  scientific  report  In  order  to  speed  up  Its  availability  to 
the  technical  community. 

Saul  Amarel 
Princeton,  N.  J. 

May  1968 


11 


ABSTRACT 


This  paper  presents  a  technique  for  translating  certain  English-like 
questions  into  procedures  for  answering  then  in  order  to  explore  how  large  a 
class  of  basic  question  types  can  be  so  processed.  The  English-like  questions 
all  pertain  to  simple  diagrams  built  of  elementary  figures  with  relations 
like  "above"  and  "larger  than."  The  input  to  the  program  into  which  the 
algorithm  presented  here  could  be  implemented  are  questions  suck  as  "Is  it 
true  that  in  Fig.  1  each  triangle  is  above  a  circle,"  and  may  Include  terms 
like  "how,"  “when,"  ,lwhat"  in  an  interesting  variety  of  interrogative  rentence 
types.  The  output  of  the  program  is  a  fltrv  diagram  for  another  program  to 
answer  the  question  by  Inference  and  search  of  a  structured  data  base  in 
which  representations  of  diagrams  are  stored. 

The  English-like  source  language  of  questions  that  the  algorithm  can 
process,  though  restricted  and  fixed  in  syntax  and  dnmali.  of  discourse,  has  a 
potentially  wide  scope  in  that  it  includes  some  of  the  fundamental  question 
types. 
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I.  INTRODUCTION 


When  Turing  (38)  proposed  a  teat  for  distinguishing  between  the  verbal 
behavior  of  a  person  alleged  to  betoken  thinking  and  corresponding  behavior 
on  the  part  of  a  machine,  a  challenge  presented  itself  to  computer 
scientists.  How  can  a  computer  be  programed  to  answer  questions  whi.h 
resemble  more  and  more  the  questions  we  ordinarily  expect  that  only  people 
can  understand?  During  the  past  decade  several  dosen  question-answering 
programs  have  been  written,  e.g.,  (16),  (28),  (29),  (15),  (9),  (14),  (33). 
Concurrently  with  and  independently  of  the  work  reported  In  this  paper 
(completed  in  1965),  a  number  of  related  studies  appeared  (10),  (39), 

(35),  (37). 

A  recent  critical  review  (18)  of  this  literature  pointed  out  same 
major  gaps  in  the  theoretical  underpinnings.  This  review  concludes  that 
"the  only  hope  for  success  In  the  near  future  Is  In  well-structured  data¬ 
base  systems,  having  a  special  Internal  structure  appropriate  to  a  specific 
field,  a  reliable  technical  language,  and  a  competent  Inference  mechanism." 

Presented  here  Is  an  algorithm  for  processing  Engllsh-llke  questions. 
"Processing"  as  used  above  means:  (a)  parsing  the  question  to  analyse 
its  syntactic  structure;  (b)  building  up,  as  a  by-product  of  parsing, 
and  simultaneously,  the  flow-diagram  for  a  computer  program;  (c)  the 
program  If  run,  would  search  a  structured  data  base  for  the  answer  to 
the  question. 

A  typical  question  is:  "Is  it  true  that  in  Fig.  1,  each  circle  which 
Is  inside  a  triangle  is  above  a  rectangle."  The  domain  of  discourse  to 
which  all  questions  and  the  data  base  la  restricted  consists  of  single 
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diagram*  composed  of  geometric  figure*  arranged  into  various  pattern*  like  that 

* 

shown  in  Fig.  1.  A  data  base  of  such  figures,  each  represented  for  storage 
by  descriptive  forms  like  H  (3,  V  (4,5),  6))  for  the  following  figure. 


for  example,  la  well  structured.  The  questions  can  be  posed  in  teres  of 
a  minimum  vocabulary  Including  the  predicates;  H(a,h)  ■  "a  is  to  the  left 
of  b".  V(o,b)  -  "a  is  above,  b"  and  I(a,b)  »  "a  encloses  b",  with  a  and  b 

being  the  names  of  individual  elementary  figures.  Our  aim,  however,  is 
to  use  the  maximum,  not  the  minimum,  vocabulary  for  basic  concepts  that 
are  coaann  to  many  ordinary  questions,  and  to  explore  how  large  a  class 
of  question  types  ths  algorithm  can  process. 

The  rules  of  inference  used  here  are  those  of  a  dlasslcal  applied 
predicate  calculus,  because  nothing  sort  sophisticated  (e.g.,  modal  logic) 
le  required  for  the  purpose  on  hand. 

This  seems  to  have  been  used  first  by  MlnsVy,  end  subsequently  in 
(30).  (20),  (21),  (2?),  (1),  (27),  (32). 
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While  neither  the  cleee  of  problem  studied  here  (.19),  (20),  nor  the 
methods  of  syntactic  analysis  and  the  accompanying  translation  techniques 
(35),  (1),  (2),  (6),  (7),  (10),  (34),  (19),  nor  the  uae  of  rules  of  Inference 
(3-2),  (28)  are  novel,  their  conblnetlon  Is,  and  may  suggest  many 
Interesting  extensions. 
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XI.  TBB  SOURCE  LANGUAGE:  VOCABULARIES  AND  FIRST  RULES  OF  FORMATION 


To  specify,  as  a  formal  language"  Che  class  of  questions  which  the 
slgorltha  processes  Is  to  specify  the  quintuple:  {V^,  V^,  Q,  R.  T}.  Here: 

VT  lu  the  set  of  words  and  phrases  -  the  vocabulary  -  with  which 
the  questions  are  composed. 

Vjj  is  the  set  of  "nonterminal"  auxiliary  symbols  In  terms  of  which 
the  formation  rules  for  generating  the  questions  are  expressed. 

Q  la  a  special  dement  of  V^,  which  may  be  thought  of  as  labeling 
the  class  of  well-formed  questions. 

R  is  the  collection  of  formation  rules,  which  look  like 
Q— MQTRUE)  (SENT)  or  QTRUE — >  Is  It  true  that. 

Such  rules  tell  us  that,  from  the  symbol  at  the  left  of  the  arrow 
we  may  produce,  generate,  form,  the  "string"  of  symbols  at  the  right.  All 
words  in  capitals,  which  may  be  enclosed  In  parentheses,  like  Q,  QTRUE, 

SENT  sre  elements  of  V^;  each  Is  thought  of  as  a  single  symbol.  He  can 
form  strings  by  concatenating  -  placing  next  to  each  other  -  such  symbols, 
being  sure  to  preserve  the  order  In  which  they  appear.  In  a  rule  of  the 
second  type,  a  phrase  which  Is  not  all  capitals  appears  to  the  right  of 
the  arrow.  It  Is  an  element  of  V^. 

T  is  the  collection  of  transformation  rules,  to  be  explained  later. 
Beginning  with  the  symbol  Q,  the  rules  R  are. applied  until  only  elements  of 
appear  In  the  resulting  string;  l.e.,  until  all  non-terminal  symbols 

This  method, or  variants  thereof,  has  bean  used  in  combinatorial  lin¬ 
guistics  (19),  in  syntax-directed  compilers  (17),  In  mechanical  translation 
and  language  processing  (4)  (22)  to  mention  but  a  few  of  the  applications 
of  this  formal  approach  to  linguistic  systems  first  Introduced  in  this  way 
by  logicians  (6)  (though  some  of  the  Ideas  stem  from  the  17th  Century)  and 
further  developed  by  linguists  (8) . 
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like  QTRUE,  SENT  etc.,  have  been  replaced  by  terminal  vocabulary  elements 
according  to  rules  that  allow  this.  The  set  of  all  questions  that  may  be 
formed  this  way  from  the  graonar  G  *  V^,  Q,  R,  T)  Is  called  a  formal 

language,  Lg. 

The  reverse  process  of  generating  all  the  sentences  of  J«_  from  the 

U 

starting  symbol  Q  la  called  parsing.  Beginning  with  a  string  of  elements 
in  Vj  -  a  candidate  question  -  u<  seek  rules  In  R  or  T  and  try  to  apply 
them  In  a  suitable  order  so  that  the  given  sentence  could  have  somehow 
been  generated  from  Q. 

The  grammar  G  for  the  source  language  we  present  next  Is  restricted 
In  comparison  with  a  grammar  for  English  or  even  for  other  source  languages 
of  question-answering  algorithms,  with  regard  to  Its  syntax.  Our  aim, 
however,  la  to  Introduce  elements  of  which  represents  concepts  that  are 
fundamental  to  the  content  of  a  great  variety  of  questions.  That  la, 
t!  ov /:i  the  class  of  questions  In  are  syntactically  .*.11  very  sirtlnr,  they 
can  vary  greatly  In  thc*r  logical  content. 

In  Tables  I  and  II  we  present  the  vocabularies  V  and  V  .  The  rules 

X  |l 

are  numbered  F  1  to  F  11  and  each  Is  subdivided,  as  F  4.1,  F  4.2,  etc. 

Next  to  each  vocabulary  element  Is  Indicated  the  rule  In  which  It  Is  used. 
Note  rules  labeled  T  2.15, etc.,  appear  also.  These  are  transformation  rules 
described  later. 

We  proceed  next  to  R,  the  set  of  formation  rules. 

(F  10.4)  Q - ^  (QTRUE) (SENT) 

The  name  or  label  of  the  rule  by  which  we  refer  to  It  Is  written  In  paren¬ 
theses  at  the  left.  We  cannot,  of  course,  exemplify  the  use  of  this  rule 
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TABLE  I 


TABLE  II 


V, 

• 

V 

T 

! 

M 

Element  (W_ord_or 

Plirasc) 

Rule  Usln£ It 

Element  (Symbol) 

Mnemonic  Aid 

Rult  Uilnn  It 

above 

r  3.1 

to  inter- 

after 

r  1.21 

pretation 

and 

r  l.g 

AFT 

after 

F  10.17,10.19 

an. 

r  i.3 

AND 

and 

F  4.5,10.5 

a 

T  2.15 

ANT 

antecedent 

F  5. 2, 5. 3,6. 1,6. 2f 

before 

T  1.22 

■EF 

before 

F  10.17,10.20 

below 

F  3.2 

CHOC 

changes  occur 

F  10.14 

changes  occur 

F  1.17 

CL 

clauaa 

F  6.2 

circle 

T  2.1 

CONS 

consequent 

F  4. 3, 4. 5,4. 6, 7.1, 

circular 

F  3.13 

9,10.1,14.2 

darker  than 

F  3.7 

T  1.2, 1.3 

decreased 

T  2.10 

DO 

do 

F  10.17 

decreases 

T  2.8,  2.6 

FIC 

figure 

F  5.1 

decrease 

I  2.9,  2.16,  2.12 

FRW11 

from  where 

F  11.1,11.3 

decreasing 

T  2.13,  2.11 

IF 

if 

F  10. 8, T  1.2 

did 

T  2.12 

IMP 

Implication 

F  10.9,10.10 

do 

F  1.24 

IN 

In 

F  5.1 

each 

F  1.6 

IS 

la 

F  10.1,7.1,9,10.17 

enclosing 

F  3.5,  TR  3,5 

10.13 

figure 

F  1.3 

MI 

name  of  an 

from  where 

F  1.18 

individual 

F  4.1,5.1,10.1, 

bow 

F  1,15 

10.12,10.3 

if 

F  1.10 

NOT 

not 

F  4.3,10.7 

increased 

T  2.10 

OR 

or 

F  4.6,10.6 

Increases 

T  2.8,  2.5 

P0SFR2 

2-place  po¬ 

Increase 

T  2.9,  2.14,  2.12 

sitional 

increaclng 

T  2.13,  2.11 

predicate 

F  3.1,  F  3.2 

inside 

F  3.6 

POST 

poet-fix 

F  10.15 

in 

F  1.2 

FRE 

pra-fix 

F  10.14 

1.  it  falsa  that 

T  2.3 

QUAL 

qualifier 

F  10.2-4,11.3,11./ 

la  it  true  that 

F  1.1 

T  1.1,1. 2, 1.3, 2. 4 

la  there 

F  2.4 

QUANT 

quantifier 

F  4.2,  T  1.2 

is 

F  1.4 

QUA 

universal 

larger  than 

F  3.9 

quantifier 

F  5.3,10.10 

lighter  than 

F  3.8 

QUE 

exist  partial 

noved 

I  2.1'. 

quantifier 

F  5.2,10.9,  T  1.3 

moves 

T  2.3,  2.7 

QTRUF. 

la  It  true 

wove 

I  2.9,  2.14,  2.12 

that 

F  10.7,10.4,10.3. 

■ovlng 

T  2.11,  2.13 

T  1.1 

not 

F  1.7 

SENT 

sentence 

F  10. 1.10.21  T  1  .1 

no 

T  2.2 

F  11.1-4,  T  1 .1 

object 

F  8.1 

SIIFK1 

1-place  sharp 

or 

F  1.9 

predicate 

F  4.4 

rectangle 

T  2.1 

STDSENT 

"this  sentence 

rectangular 

F  3.12 

la  uttered, 

related  to 

F  1.16 

standard 

•sailer  than 

F  3.10 

sentence 

F  10.18-21 

then 

F  1.11 

THEN 

then 

F  10.8 

this  sentence  Is 

uttered 

F  1.25 

T0VH 

to  where 

F  11.2,11.4 

to  the  left  of 

F  3.3 

VAR 

variable 

F  6.1,  3  1.3 

to  the  right  of 

F  3.4 

VUA 

what 

F  10.12,  T  1.3 

to  where 

F  1.19 

WHI 

while 

F  10.18,10.21 

trlanele 

T  2.1 

WIN 

when 

F  10.16,10.1 7 ,10. 1 

triangular 

F  3.11 

W 

which 

F  7.1  T  1.3 

what 

F  1.13 

when 

F  1.20 

where 

F  1.14 

which 

F  1.12 

while 

F  1.23 

will 

I  2.9 
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until  we  have  Introduced  enough  rules  to  define  (specify  what  we  can  sub¬ 
stitute  for),  QTRUE  and  SENT.  The  symbol  SENT  Is  a  mnemonic  label  for 
all  well-formed  sentences,  for  completeness,  however,  we  Introduce  two 
variants. 

(F  10.2)  Q.«— (QTRUE)  (QUAL)  (SENT) 

(F  10.3)  Q - f  (QUAL)  (OTRUE)  (SENT) 

(r  5.1)  QUAL— >  (IN) (FIG) (NX)  ;  OUAL  Is  mnemonic  for 

"qualifying  clause" 

The  next  feu  rules ,  all  labeled  (F  1 .X) ,  relate  to  terminal  vocabulary 
words  and  phrases.  We  Introduce  only  a  partial  list  at  this  point,  so 
that  we  can  Illustrate  their  use  and  justify  their  selection. 

(F  1.1)  QTRUE— >  Is  It  true  that 

(F  1.2)  IN  — ^  In 

(F  1.3)  TIG  — ■=>  figure 

(F  1.4)  IS  — >  Is 

Rules  labelled  (V  2.X)denotc  all  the  individual  constants:  the  under¬ 
lined  numbers  labelling  unique,  specific  geometric  object  like  circles, 
rectangles  of  which  the  patterns  are  built.  No  two  circles,  for  example, 
are  given  the  same  label,  and  all  the  figures  which  constitute  patterns 
enclosed  In  a  rectangular  frame,  are  also  labelled:  Figure  1_,  etc. 

(F  2.1)  NI  — >  1  1  -  1,  2,  3  - ... 

we  used  underlined  numbers  to  label  or  refer  to  our  figures  and 
building  blocks  or  primitive  objects. 

The  next  set  of  rules,  like  (F  l.X)  and  (F  2. X),  also  point  to  terminal 
words  and  phrases,  but  only  to  those  indicating  relations,  or  predicates. 
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with  which  sentences  -  the  assertive  parts  of  each  question  -  are  forasd. 

The  first  is: 

(F  3.1)  P0SPR2  * - >  above.  (P0SPR2  Is  mnemonic  for  a  2- 

place  position  predicate) 

We  can. now  illustrate  the  use  of  the  rules  presented  no  far.  Applying 
rules  (F  0.1)  (F  1.1)  produces,  starting  with  Q, 

(Is  it  true  that)  (SENT) 

We  now  jump  to  a  rule  for  aentencehood : 

(F  10.1)  SENT  - — >  (Nl) (IS) (CONS) 

(F  4.1)  CONS  . — >  (POSPR2) (NI) . 

Substituting  into  (F  10.1)  the  result  of  rule  (F  2.4)  and  into  (F  4.1) 
the  result  of  (F  2.6),  we  have  SENT  — (4)  (IS)  (P0SPR2)  (6).  Applying 
(F  3.1)  and  (F  1.4)  and  combining,  we  have 
"Is  it  true  that  4^  is  above  6". 

It  is  a  well-formed  sentence.  With  respect  to  Figure  2_,  It  can  be 
answered  "Yes".  With  the  help  of  rules  (F  10.2),  (F  1,2),  (F  1.3)  and 
(F  2.2),  we  could  also  have  formed  the  more  precise  question: 

"la  It  true  that  In  Figure  2  Is  above  6^". 

Had  we  used  rule  (F  10.3)  rather  than  (F  10.2)  In  the  above  production 
process,  we  could  have  obtained: 

"In  Figure  ,2.  1*  It  true  that  4^  la  above  6_".  This  question  is 
related  to  the  one  juat  above  by  a  simple  transformation.  We  can  exprsss 
it  as: 

(T  1.1)  (Q0AL) (QTRUE) (SENT)<— >  (QTRUE) (QUAL) (SENT). 

Clean  (F  10.4),  (F  10.2),  and  if  vs  have  (T  1.1),  we  no  longer  need  (F  10.3). 
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The  set  T  of  transformation  rules  are  all  like  (T  1.1)  and  serve  to 
render  the  questions  generated  by  T  wot e  English-like  and  to  avoid  very 
sideward  phraseology. 
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III.  TK  SOURCE  LANGUAGE:  FURTHER  SPECIFICATIONS  FGR  ASSERTIVE  QUESTIONS 


By  an  assertive  question  we  mean  one  generated  by  rule  F  10.2.  It 
begins  with:  "Is  it  true  that...".  Rule  (F  4.1)  has,  so  far,  only  enabled 
us  to  suite  assertions  about  individual  objects.  To  be  able  to  pose  a 
question  like,  "la  it  true  that  4^  is  above  every  circle",  we  pick  QUANT 
a  (non-terminal)  label  for  the  class  of  phrases  like  "every  circle".  It 
is  used  in: 


(F  4.2) 

CONS—*  (P0SFR2) (QUANT) 

(QUANT  is  mnemonic  for  a 
clause  with  quantifiers) 

(F  5.2) 

QUANT — >  (QUE)  (ANT) 

(F  5.3) 

QUANT  — >  (OUA)  (ANT) 

(F  6.1) 

ANT  — >  VAR 

(ANT  is  mnemonic  for  "antecedent") 

(F  6.2) 

ANT  - >  (ANT)  (CL) 

(CL  reads  "Clause") 

<F  7.1) 

CL  — >  (W) (IS) (CONS) 

(F  8.1) 

VAR  — >  object. 

The  word  "object"  is  generic. 

the  word  "variable"  for  which  VAR  is  mnemonic.  We  might  have  chosen  to 

"object"  as 

a  one-place  predicate.  Since 

our  universe  of  discourse 

however  consists  only  of  simple  geometric  figures  which  are  individually  labelled, 
the  word  object  is  generic  to  these  labels.  That  is,  JL,  •••  are 

specific  to  the  word  "object". 

Ha  now  add  to  our  previously  started  partial  list  of  rules  that  relate 
to  the  terminal  vocabulary-  (F  l.X)  for  function  words  and  (F  3.X)  for 
predicates : 


(F  1.5) 
(F  1.6) 
(F  1.7) 
(F  1.8) 
(F  1.9) 
(F  1.10) 
(F  1.11) 
(F  1.12) 


(the  existential  quantifier) 
(the  universal  quantifier) 


which 


This  list  will  be  coapleted  in  the  section  where  the  additional  ones 


are  used. 

So  far.  we  could,  from  the  sequence  of  rules  (F  10.1),  (F  2.1),  (F  1.4), 
(F  4.2),  (F  3.1),  (F  5.2),  (F  1.5),  (F  6.1),  and  (F  8.1)  fore  the  sentence 
(starting  with  SENT):  "1  is  above  an  object."  If  we  wanted  to  improve 
this  stylistically,  we  would  introduce  the  transformation  "an  object" *■  ■■ 


"something".  Next,  we  Introduce  more  predicates: 


(F  3.2)  P0SPR2 


(F  3.3)  P0SPR2 
(F  3.4)  P0SPR2 
(F  3.5)  P0SPR2 


below 


to  the  left  of 


to  the  right  of 
enclosing 


(another  2-place  position 
predicate) 


(F  3.6)  P0SPR2 - >  inside 

The  reader  can  readily  verify  that  from  SENT,  with  the  help  of  rules 
(F  10.1)  and  (F  4.2),  and  other  rules  we  can  form  sentences  like: 

S^:  1.  is  above  each  object 

S^:  1^  is  above  each  object  which  is  enclosing  2. 
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Sj:  JL  is  below  ssch  object  which  is  inside  esch  object  which  is  above  2,. 

S^t  JL  is  below  sa  object  which  is  to  the  left  of  esch  object  which  Is 
enclosing  an  object. 

Xt  is  essy  to  see  that  an  infinite  number  of  well-formed  sentences  can 
thus  be  generated,  because  there  is  no  restriction  on  how  often  rules 
(F  6.2),  (F  7.1)  and  (F  4.2)  can  be  reapplied.  In  s  figure  with  e  finite 
number  of  objects,  only  a  finite  number,  however,  can  be  materially  true, 
if  redundancies  are  not  counted.  The  clause  "object  which  is  1_"  can  ob¬ 
viously  be  replaced  by  "l" .  Some  sentences  express  impossible  configurations 
like  "1,  is  above  each  object,"  because  1^  cannot  be  above  Itself. 

(F  3.7)  DKFR2  - - >  darker  than  (a  2-place  intensity 

predicate) 

(F  3.8)  DKPR2  - >  lighter  than 

(F  3.9)  SZPR2  - >  larger  than  (a  2-place  slse  predicate) 

(F  3.10)  SZPR2  7  smaller  than 

(F  3.11)  SHPR1  - >  triangular  (a  1-place  shape  predicate) 

(F  3.12)  SHPR1  — rectangular 
(F  3.13)  SHPR1 - >■  circular 

Quite  a  variety  of  interesting  sentences  can  be  generated  at  this  point, 
but  they  are  stylistically  awkward.  The  following  transformations  are  of 
general  value  In  producing  more  Engllah-llke  sentences,  and  they  improve 
the  sentences  producible  so  far. 

T  1.2.  (QUAL) (QTRUE) (QUANT) (IS) (C0NS)4“^  (QUAL) (IS) (QUANT) (CONS) 
Kxsmple:  "In  Figure  1.  is  it  true  that  each  circle  which  is  inside  a  triangle 
above  a  rectangle"  <  ^  "In  Figure  JL  is  each  circle  which  is  inside 
a  triangle  above  a  rectangle". 
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(T  2.1) 

(T  2.2) 
(T  2.3) 
(T  2.4) 

(T  2.15) 


{eirc  "S  /eirc 

rectangv  ularOj  rec  tangS-  1* 
erlang jf  Utriang  J 

not  an  4-9  no 

not  la  It  true  that  f41i  It  falae  that 


(QTRUE)  (QUAL)  an  object  la  «-»l»  there  an  object  (QUAL) 
/circle  ^circle 


an  J  rectangle/^-^a  J  recta.igleS- 
/ circle  y  /triangle y 


The  following  formation  rulea  permit  us  to  enrich  our  class  of  questions 
by  Incorporating  the  main  devices  of  ordinary  propositional  logic. 

(F  4.3)  CONS  - =>  (NOT) (CONS) 

Together  with  (T  2.2),  this  rule  could  generate  "above  no  object", 

"above  no  circle". 

->  SHPR1 

•>  (CONS) (AND) (CONS) 

->  (CONS)  (OR)  (CONS) 

jy  (ANT) (IS) (CONS);  IMP  is  mnemonic  for 
"Implication",  and  If  It  Is  recalled  that  ANT  suggests  "antecedent" 
and  CONS  "consequent",  this  rule  will  be  seen  to  stand  out  as  one  of  the 
aest  essential  and  powerful. 

We  finally  augment  our  rules  for  sentence  formation: 


(P  4.4) 
(F  4.5) 
(P  4.6) 
(P  9) 


CONS 

CONS 

CONS 

IMP 


(P  10.5) 

SENT 

- => 

(SENT) (AND) (SENT) 

(P  10.6) 

SENT 

- > 

(SENT) (OR) (SENT) 

(P  10.7) 

SENT 

- > 

(NOT) (SENT) 

(P  10.8) 

SENT 

- > 

(IF) (SENT) (THEN) (SENT) 

(P  10.9) 

SENT 

- - > 

(QUE) (IMP) 

(P  10.10) 

SENT 

_ > 

(QUA) (IMP) 
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_ _  -TS 


The  following  ir.  a  sample  of  10  questions  which  can  be  generated  from  the 

rules  stated  up  to  this  point. 

E.l.  In  Fig.  1.  is  each  circle  which  inside  a  triangle  above  a  rectangle 

E.2.  la  there  an  object  in  Fig.  1^  which  is  inside  a  circle  which  is  lnalde 

a  rectangle 

E.3.  Is  it  false  that  in  Fig.  1_  a  circle  is  above  each  triangle 

E.4.  la  it  true  that  in  Fig.  1  no  circle  which  is  larger  than  a  circle 

lnalde  a  triangle  is  to  the  left  of  a  triangle  which  is  above  a 
rectangle 

E.5.  If  each  circle  la  larger  than  2  then  is  it  true  that  in  Fig.  1  each 
circle  is  above  a  triangle. 

E.6.  la  it  true  that  in  Fig.  .1  if  each  triangle  is  above  a  circle  then  if 
each  triangle  is  inside  a  rectangle  than  a  circle  is  Inside  a  circle 
then  a  circle  la  larger  than  3.* 

E.7.  Is  it  true  that  in  Fig.  1_  a  triangle  is  above  a  circle  or  a  triangle 
is  below  a  circle. 

E.S.  Is  it  true  that  in  Fig.  1_  each  circle  is  inside  a  triangle  and  each 
circle  is  above  each  triangle. 

E.9.  Is  it  true  that  in  Fig.  1_  each  rectangle  which  Is  inside  a  circle 
and  to  the  left  of  a  triangle  which  is  inside  a  circle  is  larger 
than  3. 

E.10.  Is  it  true  that  in  Fig.  JL  each  circle  which  is  larger  than  2 

smaller  than  4  is  lnalde  a  circle  which  Is  larger  than  4  and  smeller 
than  3. 
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Note  that  In  E.9  we  cannot  specify  whether  the  triangle  le  to  be 
Inside  the  same  circle  as  the  rectangle.  To  do  this  we  should  say  "Inside 
the  circl*."  or  "inside  that  circle"  In  place  of  the  second  occurrence  of 
"inside  a  circle."  This  is  an  important  device  which  has  been  dealt  with 
by  P.ohncrt  (5)  . 


Note  that  E.10  can  be  answered  "No"  as  If  It  were  an  analytic 
sentence.  To  transform  E.10  Into  an  analytic  sentence  In  which  a  logical 
contradiction  Is  formally  derivable,  It  Is  necessary  to  substitute  formal 
definitions  for  "inside',"1  "larger  than"  and  "smaller  than."  One  way  to  do 
this  Is  to  introduce  special  "inference  rules"  which  are  implicit  In  the 
data  structure  or  In  the  algorithms  for  searching  the  data  structures.  For 
example,  each  of  the  10  dyadic  predicates  Introduced  so  far  are  transitive. 
Rule  1:  Let  p  denote  any  such  predicate,  e.g.  "above."  If  x,  y,  s  denote 
any  three  objects,  and  "x  Is  py,"  "y  is  pi"  are  both  true,  then  "x  is  pr" 

Is  also  true. 

Rule  2:  If  "x  Is  inside  fy"and  "x  enclosing*"]  *"  *re  both  true*  •nd 

x,  y,  s  are  all  three  circles  or  rectangles  or  triangles,  then  "y 

fls  larger  than]  „  .  ,  . 

[is  enclosing  J  *  -u,t  «l,°  be  true’ 

Rule  3:  If  "x  Is  Inside  y"  and  "y  Is  qz"  where  q  stands  for  any  of  the 
ten  predicates  except  "darker  than,"  "lighter  than,"  "larger  than,"  "en¬ 
closing,"  then  "x  Is  qz"  must  be  true. 

If  sentences  derivable  from  these  rules  are  combined  by  AND  the 
(declarative)  sentence  part  of  an  assertive  question-sentence  can  be  shown 
to  be  contradictory.  That  Is,  SENT  has  the  form  (s) (AND) (NOT) (s)  where  s 
has  the  form  SENT. 


15 


IV.  DOMAIN  UPON  WHICH  QUESTION -ANSWERING  PROGRAMS  OPERATE 


The  algorithm  being  described  produces,  for  each  question  In  the  source 
language,  a  flow  diagram  for  a  computer  program.  The  computer  program.  If 
run,  would  search  a  data  base  for  the  correct  answer  to  the  question.  The 
answer  is  obtained  cither  by  direct  lookup  or  by  Inference  from  what  can 
be  looked  up.  The  rules  of  Inference  are  embedded  In  the  rules  which  translate 
questions  of  the  source  language  Into  flow  diagrams. 

To  fix  Ideas,  let  us  adopt  a  particular  symbolic  form  for  representing 
stored  data  sa  seen  by  a  programmer.  The  data  base  consists  of  a  list  of 
descriptions  of  labelled  figures.  The  figures  are  labelled  -  numbered  - 
In  the  temporal  order  in  which  they  are  added  to  update  the  data  base.  Each 
figure  can  be  described  by  several  different  methods.  One  la  graphical, 
for  easy  visualisation,  aa  Figs.  1  and  2  at  the  beginning  of  this  paper. 

To  describe  the  essential  features  of  such  a  diagram  completely  enough 
to  make  possible  the  answer  of  all  questions  in  the  source  language,  we 
must  adopt  some  representation.  The  choice  of  representation,  particularly 
with  regard  to  Its  effectiveness  and  efficiency  for  updating  and  answering 
of  questions  requiring  inferences  la  a  fundamental  problem  (3)  (39)  but  not 
focal  to  this  paper.  In  which  we  aim  at  probing  the  non-syntactic  boundaries 
of  our  source  language.  The  representation  for  storing  diagram  daacrlptiona 
wa  chose  takes  advantage  of  the  fact  that  all  the  two-place  relations  we 
used  (la  above,  la  to  the  right  of,  la  Inside,  la  darker  than,  la  larger 
than)  art  transitive.  To  make  this  precise,  let  us  symbolise,  to  indicate 
to  a  programmer  how  to  encode  It  for  storage,  our  predicates  in  the 
terminal  vocabulary  as  follows: 


H  ’(l.D 


"i  is  immediately  to  the  left  of  ■  "J[  Is  immediately  to 
the  right  of  i" 

H  (i^J^  *  *4  is  somewhere  to  the  left  of 

E  H  (i,JL)  “  '4  is  not  to  the  left  of  and  not  to  the  right  of  j_" 

V  «  '4  Is  Immediately  above  is  just  below  i_." 

V  (i^,J[)  »  '4  is  somewhere  above 

E  V  (i.,i>  "  "1  is  not  above  and  not  below  J_" 

I  '  (i,J)  "  "1  is  immediately  inside  V'  -  '4  is  Just  enclosing 

I  (i.JL)  "  '4  is  enclosing  J/'  ■  "j.  is  somewhere  Inside  i" 

E  X  (i^J)  •»  '4  is  not  enclosing  and  not  Inside  J" 

D  '  (i_.l)  "  "i  is  Just  a  shade  darker  than  J_" 

D  '(i.D  -  "i  is  darker  than  i” 

E  D  (i«i)  *  "i  *»  »ot  darker  than  and  not  lighter  than 

S  ’<i,D  ■  '4  is  Just  larger  than  i" 

5  (i.i>  "  "i  i»  larger  than  J." 

E  S  (i.,i)  "  '4  la  not  larger  and  not  smaller  than  J[" 

*  (i)  *  '4  is  rectangular" 

T  (i)  "  "i  la  triangular" 

C  (i)  -  "i  la  circular" 


To  completely  represent  Pig.  2,  we  would  use  the  expression: 

FIG.  2  -  l'(2,  H'(3,  V (!•(«, 5) ,6)))  S  EH  (4,6)  &  T(3)  &  R(4)  6  C<5)  &  T(6) 

The  &  means  "snd";  it  takes  all  of  these  statements  to  describe  the  figure 


to  wit 


Iirect  horizontal  and  vertical  alignments.  That  is,  (  ,{jJ 

[7 

are  considered  identical,  snd  can  both  be  represented 
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by  H'  (i_,J^  &  EV  We  would  call  1.  above  1  only  If  each  point  of  1^ 

la  above  each  point  of  J^.  We  would  represent  CU  by  H ' (l,i) 

*V’(i,i).  _ [I[ 

To  answer  the  question,  "In  Fig.  2_,  is  It  true  that  5^  is  Inside  i", 
we  Interpret  "Inside"  to  be  either  "Just  inside"  or  "somewhere  Inside." 

We  search  the  data  base  for  a  description  of  Figure  2_,  by  going  through  2^ 
as  an  index  terra  which  points  to  the  location  where  the  description  Is  stored. 
The  description,  a  coded  version  of  Fig.  2,  Is  retrieved  and  scanned  for 
I’(4,3).  Since  this  is  readily  found,  the  search  ends  with  a  positive 
answer. 

To  answer  the  question,  "In  Fig.  2,  la  it  true  that  5  is  above  6" 
requires  a  more  subtle  search  through  the  expression  in  Fig.  2.  The  search  la 
completed  with  a  positive  answer  either  if  we  find:  (a)  V'(£, 6)  or 
(b)  V(_5 ,6J  ;  rules  of  Inference  applying  to  these  representations  also 
permit  us  to  add  conditions  like:  (C)  V ' ( 1 1  (i,5J  ,6)  es  well  as  many  other 
condltlcna  which  follow  from  an  Inference  rule  like;  "any  object  inside 
an  object  above  e  third  object  Is  above  that  third  object."  All  these 
conditions  ere  equivalent  to  V(S.,6) . 

We  assume  that  subroutines  for  searching  e  string  like  Fig.  1  for 
the  truth  or  falsity  of  are  stored,  can  accept  es  Input  any  such 


string,  and  produce  es  output  either: 

"Tee,  V(1,J)  is  true  for  Input  string  FIG.  1"  or 
"Wo,  V(l^J)  Is  not  trus  for  input  string  FIG  1." 

We  represent  this  subroutlns  by  the  following  fragment  of  e  flow 


diagram: 


Fig.  i. 


18 


The  domain  upon  which  the  quest lon-anawerlng  program  operates,  then. 

Is  the  set  of  all  acceptable  strings  like  Fig.  1.  The  following  simple 
rules  of  formation  can  characterise  this  set: 

S  ■  — »  E/P1(N)/S4S,  when  S  is  mnemonic  for  the  set  of  well- 

formed  strings.  The  slash,  /,  means  alternatives,  l.e.,  "or." 

E - P2(A,A) 

P2 — *  H’/V'/I'/EH/EV/EI  as  defined  before 

A  - >  N/E 

N  — >  1/ 2/3/4/ 5/... 

PI— >  R/T/C  as  defined  before. 

We  could  develop  a  grammar  for  generating  Engllsh-llke  declarative 
sentences  corresponding  to  these  strings,  and  these  declarative  sentences 
would  all  be  such  simpler  than  the  questions  in  our  source  lnneuage  because 
they  need  only  provide  minimal  irredundant  descriptions  of  tin.  figures. 

The  language  generated  by  this  simple  grammar  is  Infinite  because 
both  E  and  S  can  appear  Infinitely  often  through  the  iterated  use  of  rules 
(lc),  (2)  and  (4b).  That  is,  configurations  of  an  arbitrarily  large  number 
triangles,  circles  and  rectangles  of  varying  size  and  Intensity  combined 
In  an  arbitrarily  large  and  complex  number  of  arrangements  can  constitute 
any  one  figure  being  stored.  The  data  base  Is  a  stored  corpus  of  such 
figures. 
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V.  TRANSLATING  ASSERTIVE  QUESTIONS  INTO  FLOW  DIAGRAMS 


Given  a  question  in  the  source  language,  the  first  step  of  the 
algorithm  is  to  parse  the  question.  This  means  the  construction  of  a 
phrase-marker  or  labelled  bracketting  of  the  question  indicating  which 
rule  of  C  is  applied  when.  Let  us  illustrate  with  the  question: 

Q  “  "Is  it  true  that  in  Figure  J  4  is  enclosing  5/' 

We  scan  Q  from  left  to  right  until  we  find  a  string  which  matches  an  argument 
In  Table  I,  looking  for  the  longest  possible  match  (11),  (22).  Thus, 
though  we  can  match  "is"  as  the  4th  entry  in  Table  I,  we  match  on  "is 
it  true  that",  the  2nd  item  in  Table  I.  Table  1  tells  us  to  apply  rule 
(F  1.1),  which  is  QTRUE - — >is  it  true  that.  We  thus  bracket  "is  it  true 
that"  in  Q  and  indicate  that  it  may  have  been  generated  from  QTRUE. 

We  proceed  with  our  lef t-to-right  scan,  to  bracket  "in"  and  indicate 
that  it  was  produced  from  IN  by  rule  (F  1.2).  We  indicate  that  this  was 

our  second  step  by  the  circled  2.  As  we  proceed  to  step  7  in  this  way 

* 

we  encounter  rule  (F  3.5),  P0SFR2- — > enclosing ,  wc  note  in  Table  I  that 

there  is  an  accompanying  translation  rule  TR  3.5.  It  is  to  form: 

(TR  3.5)  Q 

-> 

rosi'R' 


1(5,0)  £ 


POSPR2  la  Che  name  of  this  box.  Its  Input  Is  labelled  Q  and  represents 
a  control  signal  to  Initiate  the  operation  of  searching  a  specified  string 
for  the  truth  of  V(s,o),  after  a  and  o  have  been  specified.  The  output 

la  a  decision,  a  conditional  transfer.  It  is  T  or  N,  depending  on  whether 
* 

Table  I  lists  the  translation  rule  for  only  "cnclosin!;,"  as  an  example,  but 
does  not  indicate  which  translation  rules  goes  with  each  word  or  phrase, 
if  any. 
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V(s,o)  Is  true  or  false.  Control  la  transferred  from  POSPR2  to  the  box  to 
which  the  Y  or  N  line,  whichever  la  activated,  points. 

After  we  scanned  Q  from  left  to  right  and  obtained  a  string  Q,  of 
non-tcrmlnal  symbols  from  which  Q  may  have  been  generated,  we  scan  Q'  from 
right  to  left.  In  step  9,  the  box  previously  named  P0SPR2  Is  renamed  CONS 
and  o  is  set  to  At  step  10,  the  box  Is 
set  s  to  4^.  At  step  11,  we  form  the  box -> 

At  step  12,  these  two  boxes  are  joined  to  produce: 


again  renamed  SENT  and  we 


Isearch  the  data  base  for  Figure  2 


Y 

N 


Q 


The  entire  flow  diagram  Is  now.  In  step  12,  by  (F  10.2),  Is  labelled  Q. 


Next  we  describe  the  translation  rules  accompanying  the  appropriate 
formation  rules. 


* 
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TR 

3.1 

(with 

TR 

3.2 

(with 

TR 

3.3 

TR 

3.4 

TR 

3.5 

TR 

3.6 

TR 

3.7 

TR 

3.8 

TR 

3.9 

TR 

3.10 

TR 

3.11 

TR 

3.12 

TR 

3.13 

nil*  (F  3.1)): 
rule  (F  3.2)): 


Fora 

Fora 

Fora  ' 
Fora  • 
Fora  ■ 
Fora  ■ 
Fora  - 
Fora- 
Fora- 
Fora  • 
Fora  - 
Fora  - 
Fora  - 


P0SPR2 

P0SPR2 

"F  (h(«.q)I  ^ 

->  m:  jz 


("I*  •  above  o") 

("la  a  below  o") 

("la  a  to  the  left  of  o") 

("la  a  enclosing  o") 

("la  a  darker  than  o") 

("la  a  larger  than  o") 

("Is  a  triangular") 

("la  a  rectangular") 

(la  a  circular") 


There  are  no  translation  rules  with  (F  l.X)  except  for  "an"  and  "each," 
For  (F  2.1).  which  la  Nl*-M,  we  have  rules  TR  2.1,  for  which  we  think  of 
1  as  a  value  of  HI.  We  nuaber  the  values  of  NI  as  these  occur  In  Q'  ,fron 
left  to  right.  TR  2H  tells  ua  to  store  the  value  of  MI  In  a  push-down 
Hat,  so  that  the  right-seat  occurrence  of  NI  la  on  top.  This  push-down 
is  used  In  many  other  translation  rules. 

TR  4.1  (goes  with  CONS — >F(P0SPR2)(NI)) .  P0SPR2  Is  the  naae  of  a  box 
formed  by  one  of  the  rules  TR  3.1  -  TR  3.10. 

We  now  substitute  for  o  the  value  of  NI  and  we 
renane  the  box  CONS. 

To  Introduce  the  other  translation  rules.  It  is  best  to  proceed  in  a 
certain  order,  beginning  with: 
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TR  8.1  (goes  with  VAR— ^object) .  Here,  as  In  TR  2.1,  we  fora  a  push-down 


list  x^,  Xj,  Xj,...,  assigning  x^  for  the  left-taost  occurrence  of 
the  word  "object"  as  we  scan  Q  from  left  to  right;  for  the 
second  occurrence  of  "object"  in  the  left-to-rlght  scan,  etc. 

TR  6.1  (goes  with  ANT — >VAR).  When  rule  F  6.1  Is  applied  to  Indicate  that 
ANT  generated  VAR,  we  assign  to  ANT  the  variable  of  the  top  of  the 
push-down  list.  We  call  this  its  wain  variable.  To  Illustrate, 


consider  the  phrase  (ANT),  "...object  which  Is  above  each  object..." 

_  .*  ,  I  I  I  I  I  I 

The  concept  of  VAR  W  IS  P0SPR2  QUA  VAR 

"■tain  variable"  will  \  \  \  \  \  *2 

be  used  later  in  binding  \  \  \  \ 

all  the  variables  by  ANT  \  \  \^QUANT  "" 

quantifiers.  \  \\  .C0NS 


aaln\var  ■  x 


1  CL 


TR  6.2  (goes  with  ANT — 5*- (ANT)  (CL))  .  If  the  box  marked  CL  has  s  In  It, 

substitute  for  s  the  main  variable  of  ANT,  say  x^.  Relabel  the 

new  box  ANT.  Its  main  variable  Is  also  x.  . 

k 

TR  7.1  (goes  with  CL — »» (W) (IS) (CONS) ) .  Substitute  CL  for  CONS  as  the 
label  of  the  box  and  make  the  main  variable  of  CL  the  same  as 


that  of  CONS.  First,  we  return  to 


TK  1.5  (with  QUF. — ^  an)  .  Form  the  box 


^  (QUE) ^1 


C  (QUE) 
n  ■ 


i 

’  QUE 

_ k 


CQ  (QUE) 


_ F»Cy  (QUE). 


This  Is  an  abbreviated  version  of  those  parts  of  the  flow  diagram 
duc  to  the  presence  of  a  quantifier  In  a  searrh  operation.  In 
more  detail  this  box  Is: 


Thla  concept,  like  aeveral  other*  pertaining  to  translation,  from  questions 
to  the  predicate  calculus,  la  described  in  unpublished  work  by  S.  Amaral. 
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The  input  C^CQUE)  la  a  control  signal  which  starts  the  QUE  routine 

by  initialising  a  search  through  a  previously  specified  list.  The  Input 

C  (QUE)  Is  the  control  signal  to  Initiate  a  test  to  determine  If  search 
n 

should  continue  or  cease. 

The  output  Cq(QUE)  is  the  signal  which  transfers  the  input  data  - 
the  query  specification  •  to  a  search  routine,  usually  to  a  box  labelled 
ANT.  The  output  C^CQUE)  la  a  signal  indicating  that  as  the  search  routine 
to  which  QUE  Is  coupled  succeeded  In  "matching"  the  Input  data  of  the 
question  with  something  In  the  list  being  searched. 

TR  1.6  (with  QUA-^each).  same  as  TR  1.5. 

TR  5.2  (with  QUANT — >(QUE>(ANT)) .  The  box  labelled  ANT,  like  all  boxes 
associated  with  a  search  routine  have  one  Input,  Q(ANT)  and  two 
outputs  T(ANT) 

and  N(ANT);  Q  signals  the  start  of 

searching;  Y  signals'  success  and  N  failure  in  searching.  The 
coupling  between  QUE  and  ANT  to  form  QUANT  Is  specified  by: 


Q(QUANT) 

-  C^QUE 

Q(ANT)  -  Cq(QUE) 

T (QUANT) 

-  Y(ANT 

N(ANT)  -  C  (QUE) 

n 

N(QUANT) 

-  Cy(QUE) 

Main  variable  of  QUANT  -  Main  var.  of  ANT 

This  may 

be  clarified  by 

the  diagram. 
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Fig.  II B 


In  terms  of  the  blow-up  for  QOE,  this  is 

Y(QUANT) 

(TT*.  ,  something 
Is  Inside  4) 

N  (QUANT) 

(s.g.  No,  nothing 
Is  Inside  4) 
- 

Fig.  IIC 

TR  5.3  (with  QUANT — XQUA)  (ANT) ) 

Q(QUANT)  -  CX(QUA)  Q(ANT)  -  Cp(QUA) 

T(QUANT)  -  C  (QUA)  Y(ANT)  -  C  (QUA) 

n 

N(QUANT)  -  N(ANT)  Mean  Variable  of  QUANT  -  Main  Var.  of  ANT 

We  show  the  connections  In  order  to  explain  the  next  rule,  TR  4.2, 
as  well. 
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TR  4.2  (with  CONS— >  (POSPR2) (QUANT)).  Here  the  connection  marked  X 
In  the  above  diagram  of  the  QUANT  box  which  was  just  msde  In 
combining  QUA  with  ANT,  la  broken  to  allow  an  Insertion.  This 
also  applies  to  the  QUANT  diagram  for  (QUE) (ANT) .  We  replace  the 
coupling  expressed  by  Q(ANT)  ■  Cq(QUA)  (or  *  Cp(QUE))  by 
Q(POSPR2)  -  Y(POSPR2).  In  addition  we  have: 

Q(CONS)  -  Q(QUANT)  T(CONS)  -  Y  (QUANT) 

N(CONS)  -  N (QUANT)  -  N(P0SPR2) 

Replace  o  In  P0SPR2  by  the  main  variable  of  QUANT,  and  make  this 
also  Che  main  variable  of  CONS. 

TR  A. 3  (with  CONS — (NOT) (CONS)) 

This  Is  a  simple  relabeling  or  Inversion  of  output  leads. 

Y( (NOT) (CONS))  -  N(CONS) ;  N( (NOT) (CONS))  -  Y(CONS) 

Q( (NOT) (CONS))  -  Q(CONS) 

An  analogous  rule,  T  10,7,  also  holds  for  SENT -^(NOT) (SENT) 

TR  4.4  (with  CONS  — >SHPR1) 

Change  the  label  of  the  box  from  SHPR1  to  CONS  and  let  the 
mein  variable  of  CONS  be  that  of  SIIPRl;  namely  s. 
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TR  4 . 5  (with  CONS  — >  (CONS) (AND) (CONS) )  . 

This  rule  is  identical  with  TR  10. 5,  that  corresponding  to 
SENT  — ^  (SENT) (AND) (SENT)  and  la  obvious  to  anyone  familiar  with 
propositional  logic.  It  is  best  stated  In  a  diagram.  In  which 
CS1  stands  for  either  the  first  CONS  or  SENT  on  the  right- 
hand-side  of  the  formation  rule,  and  CS2  stands  for  the  second 
occurrence.  CS  is  the  result  of  combining  (CS1) (AND) (CS2) . 


Q(CS)L 


Y(CS1) 

Y(CS2) 

Y(CS) 

CS1 

'  r2 

Q(csi) 

i 

N(CS2) 

NfCSllJ 

* 

\ 

N(CS) 

Q(CS). 


AND 

Fig.  HE 

TR  4.6  (with  CONS — >(CONS)  (OR)  (CONS)) .  See  diagram.  Identical  with 


rule  TR  10.6. 

The  next  rule  Is  most  Important. 

TR  9  (with  IMP— >(ANT)(IS)(C0NS)).  The  rule  is: 

Q(IMP)  -  Q(ANT)  Y(ANT)  -  Q(CONS) 

Y(IMP)  -  Y(CONS)  N(ANT)  -  Y(CONS) 

N(IMP)  -  N(CONS)  Ha in  Variable  of  IMP  -  Main.  Var.  of  ANT 

and  the  main  variable  of  IMP  1*  s  wherever  s  appears  in  CONS. 

This  corresponds  to  the  usual  definition  of  Implication,  as  if 
IMP  — (NOT)  (ANT)  (OR)  (CONS)  . 

We  arc  now  in  a  position  to  complete  the  flow  diagram  by  giving 
the  rules  that  go  with  forming  sentences  and  assertive  questions. 


« 
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TR  10.9  (with  SENT — >(QUE)(IMP)> 

Q(SENT)  -  CjCQUE)  Q(IMP)  -  CQ(QUE) 

T(SENT)  -  Y(IMP)  N(IMP)  -  C  (QUE) 

n 

N(SENT)  -  CY(0UE) 

This  rule  should  bind  the  only  remaining  variable. 

TR  10.10  (with  SEMI— ->(QUA)(  IMP)).  Same  as  TR  10.9  with  QUA  In  place 
of  QUE 

We  need  no  translation  rules  for  F  10.1  except  to  relabel  the 
box  from  CONS  to  SENT;  for  F  10.4  the  QTRUE  prefix  Indicates 
Merely  that  we  have  an  assertive  question,  and  that  the  algorltha 
developed  up  to  this  point  applies.  The  translation  rule  to  go 
with  F  5.1  narrows  the  part  of  the  data  base  to  be  searched 
by  search  routines.  This  rule  then  allows  us  to  complete  the 
flow  diagrams  when  we  apply  F  10.2  or  F  10.3. 

TR  10.8  (with  SENT— >  (IF)  (SENT)  (THEN)  (SENT)) 

This  is  best  shown  in  the  diagram,  with  SI  and  S2  denoting  the 
first  and  second  occurrence  of  SENT  to  the  right  of  the  arrow 
in  F  10.8.  It  is  "not  SI  or  S2". 


This  concludes  the  set  of  translation  rules  to  be  used  in  forming  a 
flow  dlagraa  aa  a  by-product  of  parsing  an  assertive  question.  To 
summarise  the  algorltha:  (1)  the  question  la  parsed;  (2)  with  each 
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formation  rule,  starting  first  from  left  to  right,  then  back  from  right 
to  left,  back  and  forth,  a  corresponding  translation  rule  is  applied;  (3) 
the  result  of  applying  a  translation  rule  inserts  a  box  Into  the  flow 
diagram,  assigns  variables  and  constants  to  the  appropriate  search  routines 
(4)  when  parsing  is  completed,  so  is  the  flow  diagram;  all  variables  ere 
quantified  by  Iterative  scans. 

All  Inferences  ere  Inherent  In  the  representation  of  sto.-ed  data, 
not  In  the  source  language  of  questions.  Computing  these  Inferences  Is 
Implicit  In  search  boxes,  like  — i  H(s,o)  and  would  be  the  subject 

of  separate  study. 

Storing  the  translation  rules  so  that  they  can  be  used  to  Implement 
this  algorithm  somewhat  resembles  storing  the  rules  of  formation  and 
transformation.  The  latter  Involve  replacement  instructions,  such  as 
"STRING  'PATTERN’  -  'SUBSTITUTE'"  in  SNOBOL.  The  translation  rules 
Involve  primarily  the  coupling  of  connections;  The  Instructions  would 
resemble  those  for  wiring  a  circuit  or  a  configuration  of  logic  modules. 

To  Illustrate  how  the  algorithm  works  consider  a  typical  assertive 
question.  The  diagrams  show  how  the  flow  diagram  builds  up  as  we  parse. 
Pig.  Ill  shows  the  phrase  marker  and  Fig.  IV  shows  the  completed  flow 
diagram  for  the  program  to  answer  the  question. 
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VI.  QUESTIONS  INVOLVING  "WHAT,”  "WHERE, "  AND  "BOW" 

With  the  preceding  groundwork,  we  are  now  in  a  position  to  pursue 
our  primary  aim  of  showing  that  we  can  build  upon  our  primitive  algorithm 
so  that  it  can  handle  questions  In  an  enriched  source  language.  The 
universe  of  discourse  remains  the  same  and  the  syntax  Is  basically 
unchanged.  The  vocabulary  is  extended  only  slightly,  but  profoundly. 

In  this  section,  we  "enrich"  the  source  language  by  admitting  questions 
like,  "In  Figure  2  what  object  is  darker  than  3.". 

We  first  augment  our  basic  vocabulary  rules. 

(F  1.13)  WHA* — ^vhnt . 

(F  1.14)  WHE  — ^rwhere . 

(F  J.15)  MOW— >hov 

(F  1.16)  REL— >  related  to 

and  our  query-formation  rules 
(F  10.11)  Q — >  (QUA1.)  (WHA)  (IMP) 

(F  10.12)  Q *  <WHA)(IS)(NI) 

(F  10.13)  Q >(WHE)(IS)(NI) 

(F  10.14)  Q  — (HOW) (IS) (NI) (REL) (NI) 

The  translation  rules  accompanying  these  are: 

(TR  1.13)  Form  a  box  for  WHA  as  for  QUE 
(TR  10.11)  Treat  (QUAL) (WHA) (IMP)  as  If  it  were 
(QTURE) (QUAL) (QUE) (IMP)  except : 

(a)  replace  the  "No"  answer  by  "No  such  object" 

(b)  replace  the  "Tee"  answer  by  the  label  of  the  first  object  in  the 
llet  being  searched  which  satisfies  the  query  specification. 
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formation  rule,  starting  first  from  left  to  right,  then  back  from  right 
to  left,  back  and  forth,  a  corresponding  translation  rule  is  applied;  (3) 
the  result  of  applying  a  translation  rule  Inserts  a  box  Into  the  flow 

e 

diagram,  assigns  variables  and  constants  to  the  appropriate  search  routines 
(4)  when  parsing  is  completed,  so  la  the  flow  diagram;  all  variables  are 
quantified  by  Iterative  scans. 

All  Inferences  are  Inherent  In  the  representation  of  stored  data, 
not  In  the  source  language  of  questions.  Computing  these  Inferences  Is 
Implicit  in  search  boxes,  like  — )  H(s,o)  and  would  be  the  subject 

of  separate  study. 

Storing  the  translation  rules  sn  that  they  can  be  used  to  Implement 
this  algorithm  somewhat  resembles  storing  the  rules  of  formation  and 
transformation.  The  latter  Involve  replacement  instructions,  such  as 
"STRING  'PATTERN'  -  'SUBSTITUTE’"  in  SNOBOL.  The  translation  rules 
Involve  primarily  the  coupling  of  connections:  The  Instructions  would 
resenfole  those  for  wiring  a  circuit  or  a  configuration  of  logic  modules. 

To  Illustrate  how  the  algorithm  works  consider  a  typical  assertive 
question.  The  diagrams  show  how  the  flow  diagram  builds  up  as  we  parse. 
Pig.  Ill  shows  the  phrase  marker  and  Fig.  IV  shows  the  completed  flow 
diagram  for  the  program  to  rnswer  the  question. 
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VI.  QUESTIONS  INVOLVING  "WHAT,"  'VHBtE,"  AND  "HOT 

With  the  preceding  groundwork,  we  ere  now  in  a  position  to  pursue 
our  primary  aim  of  showing  that  we  can  build  upon  our  primitive  algorithm 
so  that  it  can  handle  questions  in  an  enriched  source  language.  The 
universe  of  discourse  remains  the  same  and  the  syntax  is  basically 
unchanged.  The  vocabulary  is  extended  only  slightly,  but  profoundly. 

In  this  section,  we  "enrich"  the  source  language  by  admitting  questions 
like,  "In  Figure  2  what  object  is  darker  than  3"- 
We  first  augment  our  basic  vocabulary  rules. 


(F  1.13) 

WHA  — ^  vha  t . 

(F  1.14) 

WHE— >w!icre. 

(F  1.15) 

HOW  — >hov 

(F  1.16) 

REL  — >  related  to 

and  our  query-formation  rules 

(F  10.11) 

Q — >  (QUAL) (WHA) (IMP) 

(F  10.12) 

Q - »  (WHA)  (IS)  (NI) 

(F  10.13) 

Q - >(WHE)(IS)(NI) 

(F  10.14) 

Q  — ^  (HOW)  (IS)  (NI)  (REL)  (NI) 

The  translation  rules  accompanying  these  are: 

(TK  1.13)  Fora  a  box  for  WHA  as  for  QUE 
(T*  10.11)  Treat  (QUAL) (WHA) (IMP)  as  if  it  were 
(QTURE) (QUAL) (QUE) ( IMP)  except : 

(a)  replace  the  "No"  answer  by  "No  such  object" 

(b)  replace  the  "Tea"  answer  by  the  label  of  the  first  object  in  the 
list  being  searched  which  satisfies  the  query  specification. 
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(c)  we  would  also  provide  the  answer  ae  a  complete  declarative 
sentence,  (QUA!) (IMP) ,  In  which  the  word  ‘'object"  la  replaced  by  the 
label  of  the  search  result. 

Example:  "In  Fig.  2_  what  object  Is  darker  than  3"  — ^ 

"In  Figure  2^  5.  Is  darker  than  3".  Certslnly  this  declarative 
sentence  Is  relevant  to  the  question,  whether  or  not  It  is  correct. 

English  usage  would  give  preference  to  "which  object"  over  "what 
object,"  to  connote  that  one  of  a  list  of  objects  is  tc  be  found,  while 
"what"  is  more  appropriate  to,  say,  "What  Is  2-"  Only  the  attributes, 
size,  shape  and  intensify,  but  not  poeltion, are  considered  relevant  to  this 
sense  of  "what."  To  handle  such  a  query,  we  Introduce  the  translation  rule 
TR  10.12.  It  Instructs  to  search  the  entire  data  structure,  through  an 
Index,  for  the  value  of  Nl;  then  to  read  out  all  the  sentences  involving 
S,  D,  C,  T  or  R  (also  using  an  index). 

A  sentence  like  "In  Fig.  ,1  what  Is  an  object  which  If  Inside  a  circle 
Is  above  a  triangle"  la  first  transformed  to:  "In  Fig.  1,  what  object 
which  Is  Inside  a  circle  Is  above  a  triangle."  If  this  second  question 
produces  an  hi,  the  value  of  this  Nl  is  used  to  form  “what  Is  NI."  To 
effect  this  complex  transformation  we  use: 

T  1.3  (QUAL) (VHA) (VAR) (W) (IS) (CONS)  -»  (QUAL) (WHA) (IS) (QUE) (VAR) (W) (IF) (CONS) 
The  question  form  which  is  relevant  to  positional  attributes  (P0SPR2) 
Involves  "where." 

TR  10.13.  This  translation  rule  Is  very  similar  to  that  for  "what."  As 
in  the  case  of  "what,"  it  is  possible  to  transform  a  question  like  "In  Fig.  J. 
what  object  is  inside  a  circle  and  above  a  triangle"  into  "In  Fig.  1^  what 
object  Is  Inside  a  circle  and  above  a  triangle"  by  rule  T  1.3.  As  before, 
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the  name  (say  1)  of  in  object  answering  the  above  description  Is  then 
used  to  form  "Where  Is  2i"  the  answer  to  which  is  provided  by  statements 
like:  H'(2,i.),  V'(2,6),  2  ln  i>  The  answer  to 

"what  is  i"  might  *ook  like:  O' (2,11).  D'(l^,l>,  S'  (1,3),  C(l>. 

To  handle  like  "In  Fig.  1_  is  each  object  which  if  inside  a 

a  circle  is  above  a  triangle, "  consider  a  transformation  into  "In  Fig.  J. 
what  objects  which  are  inside  a  circle  are  above  a  triangle."  This 
transformation.  Involving  inflections,  is  beyond  the  scope  of  this  article. 
The  point,  however,  is  to  use  QUA  instead  of  QUE  when  processing  this  query 
and  produce  all  objects  satisfying  the  query  specification,  not  just  an 
arbitrary  one  -  i.e.,  the  firat  one  encountered  during  llst-seerchlng. 

If  there  is  no  object  meeting  the  query  specifications,  the  response 
is  always  "no  such  object."  If  the  query  had  "the"  in  place  of  "an"  or 
“each  (e.g.,  "In  Fig.  1  **  the  ob3ect  which...")  there  is  an  implied 

assertion  that  there  exists  s  unique  object  meeting  the  query  specification. 
If,  in  fact,  there  is  more  then  one,  then  the  query  was  underspecified. 

If  none,  the  query  was  either  over-specified  or  wrongly  specified. 

To  deal  with  queries  Involving  "when"  and  "how,"  change-sentences  are 
introduced  in  the  next  section.  The  introduction  of  "how  many"  and  "how  much" 
leads  to  a  development  of  sufficient  scope  to  warrant  its  being  reported  as 
e  separate  paper.  But  there  is  one  sense  of  "how"  which  does  not  require 
the  use  of  change-sentences,  exemplified  by:  "How  is  2  related  to  2." 

To  this  we  turn  next. 

T*  10.14.  Search  an  index  for  the  values  of  the  two  HI,  say  2,2.  Then 
reed  out  all  stored  "sentences"  Involving  both,  e.g.:  H'(i,i),  EV(2,2> . 

C(D.  T(i),  0(i.,2).  Questions  in  which  NX  Is  replaced  by  QUANT  sre  ambiguous. 
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A  question  like,  "How  Is  2.  related  to  each  triangle  which  Is  Inside  a 
circle,"  can  be  dealt  with  In  two  ways.  We  first  form  the  question,  "What 

triangles  are  Inside  circles,"  to  get  possible  answers  i^,  f. .  We  now 

interpret  the  original  question  as  either  "How  is  2.  related  to  i.^  and 
and  ..."  or  as  "How  is  2  related  to  i^  and  how  is  2^  related  to  i^  and  ..." 
The  latter  we  can  handle  with  the  algorithm  developed  so  far.  The  former 
requires  special  routines  such  as  developed  in  the  AMNIP  system  (26). 
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VII.  EXTENDING  THE  SOURCE  LANGUAGE  TO  QUESTIONS  ABOUT  CHANGE 


Because  the  same  set  of  Individuals  appear  in  different  relations  in  all 
these  figures,  we  can  speak,  of  changes  in  these  relations.  The  number  of 
each  figure  corresponds  to  the  time  that  the  description  of  a  particular 
configuration  is  recorded  for  storage.  Thus,  at  time  1,  the  data  V  (8,9) 

&  H' (8,9)  &  R(8)  &  R(£)  is  entered  at  address  26;  at  time  2,  the  same  data 
ia  entered  at  address  27;  at  time  3,  the  data,  V'(8,9)  t  H'(8,9)  6  R(8)  &  R(9) 
is  entered  at  address  28,  etc.  Implicit  in  this  data  structure  and/or  the 
procedure  for  searching  it  are  the  statements:  "Fig.  .1  was  recorded  immedi¬ 
ately  before  Fig.  1  +  1”  and  "Fig.  was  recorded  (sometime)  before  Fig.  J." 
if  J>i.  The  predicate  "before"  is  transitive. 

Before  Introducing  tense  into  the  syntax  of  the  EngliBh-like  queries, 
it  is  useful  to  introduce  the  phrases  "changes  occur,"  "from  where"  and  "to 
where"  as  well  as  the  notion  of  a  "standard  sentence."  We  first  formally 
introduce  these  as  terminal  vocabulary  elements: 

F  1.17  CHOC changes  occur 
F  1.18  FRWH— ^  from  wher» 

F  1.19  TOWH  to  where 
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Next  we  introduce  a  new  type  of  sentence: 

F  10.15  SUNT ->  (CHOC)  (PRE)  (POST) 

F  11.1  PFU  -4  (FR'wl)  (SENT) 

F  11.2  POST  -*(T0\'ll)  (f.FNl) 

F  11.3  PRE  — ^  (FRWU) (QUAL) (SENT) 

F  11.4  POST-*(TOWH) (QUAL) (SENT) 

The  translation  rule  corresponding  to  F  11.3  and  F  11.4  la  to  search 
a  list  In  the  location  specified  In  QUAL  as  if  we  had  (QTRUE) (QUAL) (SENT) . 

The  translation  rule  with  F  10.15  Is  to  combine  PRE  with  POST  by  conjunction, 
as  if  It  were  (PRE) (AND) (POST) . 

He  next  Introduce  the  Interrogative  form  with  the  help  of  F  10.4. 

This  produces  a  sentence  like:  "Is  It  true  that  changes  occur  from  where  In 
Fig.  £  £  Is  above  9  to  where  In  Fig.  ^  2.  Is  *bove  8."  The  answer  to  this 
question  Is  "Yes"  If  and  only  If  the  answers  to  both  "Is  it  true  that  In  Fig. 

1  8  is  above  9"  and  "Is  it  true  that  In  Fig.  7  9  is  above  8"  are  "Yes." 

With  rules  F  11.1  and  F  11.2,  the  translation  rules  require  a  search  of 
the  entire  corpus  of  stored  data  for  a  figure  In  which  the  data  specified  by 
SENT  Is  stored.  Thus,  to  answer  "Is  It  true  that  changes  occur  from  where  8 
Is  above  9^  to  where  9  Is  above  (5"  It  is  necessary  to  locate  Figs.  1_,  2,  or  3 
for  PRE  and  6  or  7  for  POST.  Any  of  these  six  pairs  would  produce  a  "Yes"  answer. 

To  compare  two  sentences  describing  changes  we  Introduce: 

F  1.20  WHN  —  when 
F  1.21  AFT  --  after 
F  1.22  BEF  —  before 
F  1.23  WHI  —  while 
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These  new  elements  are  used  as  follows: 


F  10.16  SENT  —  (SENT) (UHN) (SENT) 

This  will  apply  only  when  SENT  on  the  right-hand  side  was  formed  by 
rule  F  10.13.  This  Involves  checking  that  the  same  pair  of  figure  numbers 
occur  in  bo!. a  sentences.  For  example,  consider  the  query:  "Is  it  true  that 
changes  occur  from  where  in  Fig.  1  30  is  above  40  to  where  in  Fig.  j  40  is 

above  30  _ * _ changes  occur  from  where  in  Fig.  u  35  is  inside  4!> 

to  where  in  Fig.  v  figure  35  is  to  the  left  of  45." 

The  answer  is  "Tea"  if; 

*  “  "when";  i  -  u,  J  -  v; 
x  -  "after";  i>v 

*  -  "before";  J  <u 

*  •  "While";  uji,  v£j 

In  all  cases,  i<j  and  u < v. 

Each  of  the  time-relations  couples  the  two  sentences  on  the  right-hand 
side  of  a  rule  like  F  10.16  according  the  above  4  translation  rules.  The 
other  three  corresponding  formation  rules  are  similar  in  form  to  F  10.16  with 
AFT,  BEF  and  WHI  in  place  of  HHN. 

To  deal  with  a  query  like,  "When  do  changes  occur  from  where  30  is  above 
40  to  where  40  is  above  Mi,"  we  Introduce: 

F  1.24  DO  — >do 

F  10.17  Q — ♦  (WHN) (DO) (SENT) 

With  this  rule  goes  a  search  for  two  figures  containing  the  data  specified 
in  the  PRE  and  POST  part  of  SENT  on  the  right-hand  side.  If  the  first 
of  the  two  figures  is  Fig.  £  and  second  Fig.  J_,  the  answer  would  be  "Changes 
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from  where  3(3  is  above  40^  to  where  40  is  above  30  begin  at_i  and  end  at 
We  will  not  introduce  the  necessary  complexities  to  produce  such  a  couplets 
sentence  as  the  answer  at  this  point,  but  accept  (1,J)  as  an  answer. 

To  introduce  verbs  and  tense  inflections  consider: 

F  1.25  STDSENT  •*  tills  sentence  is  uttered 

F  10.18  SENT  - »  (S:-::;T)  (rtlN')  (Sli)SENT) 

Incoming  data,  which  is  to  update  the  total  data  structure.  Is  grouped 
in  expressions  describing  labeled  figures.  The  labels  reflect  when  these 
figure  descriptions  were  stored.  A  question-sentence  posed  to  the  data 
system  can  refer  to  two  such  figure  labels,  one  correspond! v  to  the  last 
figure  stored  prior  to  the  question  being  postd,  the  other  to  the  first 
figure  stored  after  the  query  is  posed.  In  the  following  query-sentence, 
the  pronoun  "this"  refers  to  the  total  (largest)  sentence  of  which  that 
phrase  in  a  clause:  "Is  it  true  that  changes  occur  from  where  in  Fig.  30  8 
is  above  9;  to  where  in  Fig.  31^  8  is  inside  £  when  this  sentence  is  uttered." 
With  this  entire  sentence  go  two  figure  numbers,  i^  and  J_.  The  query  is 
answered  "Tes"  if  i  "  30  and  j  «  31;  otherwise  "No."  The  query  sentence 
is  not  regarded  as  part  of  the  figures  to  which  it  corresponds.  We  now 
extend  this  scheme: 

F  10.19  SENT— »  (SENT)  (AFT)  (STDSENT) 

F  10.20  SENT— » (SENT)  (8EF) (STDSENT) 

F  10.21  SENT — >(SENT) (WHI) (STDSENT) 

To  make  the  resulting  queries  more  English-like,  we  add  several 
transformations : 


» 
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T  2.5  Changes  occur  from  where  x  is  smaller  than  y  to  where  x  is  larger 
than  y — >x  increases  from  where  x  is  smaller  than  y  to  where  x 
is  larger  than  y . 

T  2.6  Changes  occur  from  where  x  is  larger  than  y  to  where  x  is  smaller 

y — >x  decreases  from  where  x  is  larger  than  y  to  where  x  is 

smaller  than  y. 

T  2.7  Changes  occur  from  where  x  is  above  y  to  where  x  is  below  y — > 

x  moves  from  where  x  is  above  y  to  where  x  is  below  y. 

T  2.8  (x)  (Vs)  (PRE)  (POST)  (l.-dL)  (STliSKNV) — »  (x)  (Vs)  (PRE)  (POST) 

Here  V  stands  for  "increase,"  "decrease,"  "move."  While  (x) (y) 
has  stood  for  the  concatenation  of  x  and  y  with  a  space  in  between. 
Vs  stands  for  cc; catenating  s  right  after  V  without  a  space. 

T  2.9  (x) (Vs) (PRE) (POST) (AFT) (STDSENT)  — »  (x) (wi 1 1) (V) (PRE) (POST) 

T  2.10  (x) (Vs) (PRE) (POST) (BEF) (STDSENT)  — »  (x) (Vd) (PRE) (POST) 

T  2.11  (x) (Vs) (PRE) (POST) (WHI) (STDSENT)  — »  (x)  (is)  tyfing)  (PRE)  (POST) 

The  i  Indicates  that  If  V  ends  In  e,  this  e  is  to  be  deleted 
prior  to  concatenating  "ing"  without  a  space  the  V  stripped  of  e.  We 
treat  "will’’,  "d,",  "ding”  as  terminal  vocabulary  elements  but  do  not 
produce  them  from  non-terminal  symbols. 

To  deal  with  a  t/  estlon  like,  "How  did  2.  move  from  where  in 
Fig.  1_  2  is  above  2  to  where  in  Pig.  2_  is  below  5.,"  we  first  consider 
a  transformation  from  the  variant  of  the  sentences  produced  by  T  2.10 
T  2.12  (H0W(x)(Vd) (PRE) (POST)  — *  (HOW) (did) (x) (V) (PRE) (POST) 

The  rule  for  flow-diagram  construction  Is  to  treat  the  question 
as  if  it  were  a  search  for  all  figures  between  Fig.  _1  and  Fig.  '±  if  these 
are  specified  as  part  of  PRE  and  POST,  and  a  search  for  all  figures 
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from  Fig.  1_  to  Fig.  2  (Inclusive)  if  these  are  not  specified  In  the  ques¬ 
tion.  The  answer  Is  to  be  In  the  form  of  a  step-by-step  description  of 
how  the  motion  occurred.  Insofar  as  data  about  all  these  steps  Is  recorded. 

Similarly,  we  have: 

T  2.13  (HOV.’Hx)  (is)(V£ngJPRE)(rOST)  — *  (HOW)  (Is)  (x)  (Vding)  (PRE)  (POST) 

T  2.14  (HOW)  (x)(wlll)(V)  (PRE)  (POST)  — >  (HOW)  (wl  11)  (x)  (V)  (PRE)  (POST) 

It  is  not  necessarily  the  case  that  all  consecutive  figure  num¬ 
bers  between  the  one  (explicitly  or  implicitly)  of  PRE  and  that  of  POST 
pertain  to  the  question.  All  of  them  must,  however,  be  examined.  If,  in 
two  successive  figures  ,  all  the  three  objects  mentioned  In  the  above  ques¬ 
tion-example  have  not  changed  their  relative  position,  then  this  need  not 
be  reported  in  the  answer.  The  answer  need  only  renort  what  changes  took 
place  and  when. 
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VIII.  CONCLUSION 


This  paper  describes  a  procedure  for  translating  English-like  questions 
about  simple  pictures  into  flow  diagrams  for  computer  programs  which  would 
answer  the  question.  What  Is  new  and  important  about  this  algorithm  is 
that  it  can  be  cumulatively  augmented  to  extend  the  English-like  source 
language  of  questions,  not  in  syntax  or  domain  of  discourse,  but  in  vocabu¬ 
lary  representing  fundamental  concepts  common  to  all  questions. 

It  would  be  ideal  to  be  able  to  state,  as  a  theorem,  the  precise 
limits  on  the  source  language.  In  the  conventional  sense,  the  grammar  C 
presented  here  specifies  the  source  language  L^  syntactically,  but  this  is 
not  very  pertinent.  In  the  absence  of  techniques  for  proving  a  theorem 
about  the  extent  to  which  Lg  Includes  significant  que6Cion-types,  a  sampling 
of  sentences  within  (and  outside)  the  content  scope  of  the  algorithm  will 
have  to  serve  as  an  adequate  means  of  conveying  to  the  reader  an  impression 
of  the  extent  of  this  scope. 

The  following  set  of  ten  sentences,  combined  with  sentences  E  1, 

...,  E  10  in  the  text,  provide  a  sampling  of  positive  Instances. 

E  11  How  did  20  move  from  where  in  Fig.  ,1  20  is  above  30  to  where  in 
Fig.  5  20  is  below  30. 

E  12  How  Is  _5  moving  from  where  _5  is  inside  J5  to  where  £  is  enclosing  £. 

E  13  How  will  6  move  from  where  in  Fig.  1.  6  is  to  the  left  of  to 

where  6  is  to  the  right  of  7_. 

E  14  Hot  is  a  triangle  which  Is  inside  5  increasing  from  where  it  is 
Just  smaller  than  7  to  where  it  is  just  larger  than  8. 

E  15  When  did  changes  occur  from  where  1_  is  Ju*t  darker  than  7.  to 
where  1  is  just  darker  than  3. 
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E  16  Is  it  true  that  a  triangle  which  Is  larger  than  each  triangle  which 
is  inside  a  circle  increases  from  where  it  is  smaller  than  a  tri¬ 
angle  inside  n  rectangle  to  wiigre  it  is  greeter  than  a  triangle 
which  is  inside  a  triangle. 

E  17  Where  Is  a  circle  which  Is  inside  a  circle  which  is  above  a  rec¬ 
tangle  which  la  inside  a  triangle  which  la  larger  than  each  triangle 

which  is  above  each  circle. 

E  18  How  la  a  triangle  inside  a  circle  related  to  a  triangle  thlch  la  not 
inside  an  object. 

E  19  Is  it  true  that  if  fig.  X  no  object  la  inside  an  object. 

E  20  la  it  true  that  in  Pig.  l_  if  no  object  is  inside  an  object  then 

in  Fig.  no  object  la  enclosing  an  object. 

With  some  additional  transformation  rules,  the  following  sentences  could 

readily  be  added ;  though  they  are  at  present  negative  instances. 


E  21  What  is  the  largest  object  in  Fig.  1. 

E  22  When  will  2  be  Just  above  3. 

E  23  How  will  2  get  to  where  2  is  Just  above  .3. 


The  following  illustrate  a  few  sentences  outside  the  content-scope  of  this 

algorithm,  though  they  are  syntactically  of  the  same  type  and  pertain  to  the 
same  domain  of  discourse. 

F  1.  How  many  triangles  are  in  Fig.  J. 

F  2.  Why  did  l,  move  from  where  1_  was  sbove  ,2  to  where  1  was  below  2. 

F  3.  How  can  1.  move  from  where  l  is  above  2.  to  where  1_  is  below 

F  4.  Is  it  true  that  .1  1*  large  and  dark 

P  5.  How  srnch  larger  than  1,  Is  2,. 

t  6.  Is  it  true  that  _1  Is  related  to  2  as  3.  is  related  to  4. 

F  7.  What  Is  a  triangle. 
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One  might  characterize  the  typical  program  represented  by  the  flow 
diagrams  that  result  from  this  translation  algorithm  as  that  of  nested 
loops  involving  search  routines.  Fig.  IV  is  a  representative  flow  diagram. 
There  is  relatively  little  computation,  mostly  string  matching,  string 
replacement,  and  string  definition.  Self-modifying  programs  would  not 
be  produced.  Yet  the  range  of  programs  represented  by  the  flow  diagrams 
la  quite  larg.e,  for  particular  strategies  for  searching  the  data  base  and 
Indexes  to  it,  for  making  Inferences  and  for  parsing  most  efficiently  are 
not  specified. 

Moreover,  the  efficiency  of  any  program  depends  critically  upon  how 
the  diagram  descriptions  are  represented  for  computer  storage  and  used  by 
the  program,  and  how  the  entire  data  base  is  structured.  But  this,  es  well 
as  the  problem  of  how  to  relate  portions  of  Incomplete  diagrams,  require 
separate  investigation.  We  cannot,  therefore,  assess  the  efficiency  of  the 
class  of  programs  represented  by  our  flew  diagrams. 

The  evidence  in  the  form  of  examples  of  questions  the  algorithm  can 
process  supports  the  conclusion  that  the  English-like  source  language  of 
auch  questions  has  a  potentially  wide  scope  in  that  it  Includes  many  of  the 
fundamental  question-types. 


>4 
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